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Abstract 

We introduce a generalized thermostatistics based on Kolmogor- 
ov-Nagumo averages and appropriately selected information mea- 
sures. The formalism includes Tsalhs non-extensive thermostatis- 
tics, but also extensive thermostatistics based on Renyi entropy. The 
Curie- Weiss model is discussed as an example. 
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1 Introduction 

We present a thermostatistical theory based on the notion of Kolmogorov- 
Nagumo (KN) mean It deals with the problem of maximizing average 
information content under the constraint that the (nonlinear) average of 
some energy function has a given value. 

From the point of view of information theory and the principle of max- 
imal entropy there are at least two ways to generalize the Boltzmann- 
Gibbs formalism of statistical mechanics. One can modify the definition of 
entropy/information, or one can modify the definition of average energy. 
We will make use of both possibilities. Many definitions of entropy can 
be found in literature []l0| . In contrast, KN-averages, although known in 
statistics since many years, appear not to be used in the context of physics. 
Therefore, we will discuss in detail how they fit with basic principles of sta- 
tistical mechanics. 

*Departement Natuurkunde, Universiteit Antwerpen UIA, Univcrsiteitsplein 1, 
B2610 Antwerpen, Belgium, E-mail: Jan. NaudtsOua. ac.be 

^Katedra Fizyki Teoretycznej i Metod Matematycznych, Politechnika Gdanska, 80- 
952 Gdansk, Poland, E-mail: mczachor@pg.gda.pl 



1 



Originally, our plan was to generalize the Boltzmann-Gibbs formalism 
by replacing Shannon's entropy by Renyi's a-entropies, and by simulta- 
neously replacing the usual linear statistical average by an appropriate 
KN-average. The resulting formalism is what we call Renyi thermostatis- 
tics in the sequel. However, the formalism allows further generalization 
which consists of replacing Renyi entropies by equivalent entropies, each 
time choosing appropriate KN-averages. By equivalent entropies we mean 
monotonically increasing continuous functions of Renyi entropies. Indeed, 
replacing an entropy by an equivalent one does not change the maximum 
entropy problem. One such entropy which is equivalent with Renyi's, and 
which is quoted often in literature, is Tsallis' entropy studied ear- 
lier by Harvda and Charvat and by Daroczy 0. The thermostatistics 
which we propose is not equivalent with the non-extensive thermostatistics 
proposed by the Tsallis school because of the non-linear averages used in 
the present paper. Renyi thermostatistics is extensive, as we will show. 
The more general formalism can be non-extensive, and, in fact, comprises 
Tsallis' thermostatistics as a special case. 

Note that throughout the paper units are used in which Boltzmann's 
constant fcs equals one. 

The paper is organized as follows. In section 2 the general formalism 
is explained. The standard Boltzmann-Gibbs formalism and some other 
formalisms are shown to be special cases. In section 3 equilibrium distri- 
butions are discussed. Special attention is paid to the definition of ther- 
modynamic temperature. Section 4 shows how nonlinear averages fit with 
the fundaments of statistical mechanics. The Curie- Weiss model is treated 
as an example. Section 5 deals with Renyi thermostatistics and its obvious 
non-extensive generalization. The Curie- Weiss model serves again as an 
example, together with the two-level system. The last section contains a 
short discussion of our results. Throughout the paper we use deformed 
exponential and logarithmic functions with a definition which is broader 
than that found in literature. Details about these are found in appendix. 

2 A general formalism 
2.1 Optimization problem 

For simplicity we restrict ourselves initially to functions /(fc) with integer 
fc = 0, 1, 2, • • •. We use the index notation /j.. 

The Kolmogorov-Nagumo (KN) average ^ depends on a monoton- 
ically increasing function (t>{x) and on parameters pk- Essential for the 
present approach is that the latter are probabilities satisfying Pfc > and 
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'^f.Pk = 1- The definition is 



{f) = ^-' {j2pkHfk)^ (1) 

We will not make use of the possibility that (/> may be decreasing instead 
of increasing. 

Fix an additional monotonically increasing function a;(x). It is used to 
define information content Ik by 

Ik=u;{l/pk). (2) 

Because lo is increasing, less probable events (i.e. smaller p^) carry more 
information. The average information content is now given by 



{i) = r'\y2PkH^ii/Pk))]. (3) 



Generalized thermostatistics studies the problem of maximizing average 
information content under the condition that average values of functions 
/i (x) fn (x) have predetermined values yi , • • • , ?;„ 

{f,)^yj forj = l,2,...,n. (4) 

Throughout the paper we consider the optimization problem with ex- 
actly one constraint, i.e., we fix some real function PoEk and optimize 
information content under the condition that {(ioE) has a predetermined 
value (3oU. It is tradition to call Ek the energy functional and to call U the 
internal energy. Note that we introduce here a constant /3o with dimension 
of inverse energy. Its first purpose is to make quantities dimensionless. Its 
actual role in the theory will be clarified later on. The maximum attained 
by (/) is denoted S{U) and is called thermodynamic entropy (not to be 
confused with the entropy functionals introduced below). The variation is 
with respect to the choice of probabilities pfc . In principle, it might be inter- 
esting to vary also the choice of functions (j){x) resp. lo{x). This extension 
is not considered in the present paper. 



2.2 Variation principle 

It is tradition Q to solve this kind of optimization problem by introduction 
of two Lagrange parameters 7 and (3, one to ensure normalization of the 
probabilities pk, the other to enforce the energy constraint. Since is a 
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monotonically increasing function an equivalent optimization problem is to 
minimize a free energy of the form 



k 



The set of conditions for an extremum reads 



= ^HiPoE)) - § ^m)) - 7- (6) 

OPk P OPk 

Using (P and (^) this simplifies to 

^ [pkHcuil/p,))] = ^ imEk) - 7) • (7) 
opk Po 

This equation has to be solved to obtain as a function of 7 and (3. It 
is not easy to progress with a problem of this generality. Let us therefore 
consider first some important examples. 



2.3 Boltzmann's entropy 

Consider first the rather trivial case that the energy function Ek is constant 
and that the index k takes on a finite number 17 of values. Then entropy 
is optimal with 

Pk = ^. (8) 
This implies immediately the result 

S^oj{^). (9) 

With ll}{x) replaced by fc^ ln(a;) this is the famous result obtained by Boltz- 
mann in the nineteenth century. 



2.4 Shannon's entropy 

Let 4>{x) = X and lo{x) — \og^^(x). With this choice of lo the information 
measure is 

4=-logbPfc. (10) 

It represents Hartley's notion of information |Q] . The expression for average 
information content becomes 

(/> =-Y.Pk \og,pu ^ S^'^—ip). (11) 

k 
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This expression is called Shannon's entropy functional Q] . Eq 

13 

- 1 - log(,pfe = PEk - —7. 

Po 

This implies 

Pfc = ^ expb(-/3£:fe) 
with normalisation factor Z given by 

Z = ^exp,(-/3i?fe). (14) 

k 

This is the Boltzmann-Gibbs distribution. For simplicity, only the base 
b = e will be used in the sequel. The notations lna{-) and exp^(-) will 
be used from now on with the different meaning of a-deformed logarithm 
resp. exponential. 



. (0) becomes 
(12) 

(13) 



2.5 Tsallis' entropy functional 

Fix some a > 0, a ^ 1, and let 4>{x) — x and ^{x) — lnQ,(a;) where 

ln„(a;) = — x > 1. (15) 

1 — a 

We use here the a-deformed logarithm (|lj, |2j], see also the appendix). 
In the limit a = 1 it reduces to the natural logarithm. Its inverse is the 
deformed exponential 

exp„(a;) = [l + (l-a).x]+'''^""', x>0. (16) 

Throughout the text [x]+ equals the maximum of x and 0. Note that 
the present example is an obvious generalization of the previous one. The 
average information content is 

(/) = E?"^-l^"(VPfc) = ^fl^^ ^ 5™(p). (17) 

fe 

This is Tsahis' entropy §, 0, |l|. Eq. (0) becomes 

^(.pr-l)=/Ji..-A,. (18) 

It can be written as 

Pk^ -il + {l-a)p{Ek-j/Po)) ■ (19) 

\_a J 

which means that pk is inversely proportional to exp^lpEk). This solution, 
with some modification, will be discussed in more detail further on. 
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2.6 Renyi's entropy 



It is well-known |g| that averaging information /, as given by (10), leads to 
Renyi's entropy of order a, if the function (jj is chosen of the form (/'(x) = 
exp((l — a)x), {a ^ 1). Let us make an innocent modification to this 
function and let 6 — (h„ with 



(l)a{x) = InQ(e^) 



1 



a: > 0. (20) 



Note that the KN-mean is invariant under this kind of modification. One 
obtains (assuming uj{x) = ln{x) as in the Shannon case) 



(I) 



^ 5«™^'(P) (21) 



The latter is the usual expression for Renyi's entropy of order a. 

It is now straightforward to solve (^ for the present choice of (p and uj. 
However, note that in course of calculation ( ^ ) the expression for Tsallis' 
entropy ( p7| ) appears. This shows that in the context of KN-means, there 
is an intrinsic relation between the entropies of Renyi and of Tsallis (this 
relation seems to be known since quite some time — see |33|). Instead of 
maximizing Renyi's entropy under the constraint that (PqE) = (3oU one 
can as well maximize Tsallis' entropy 0q((/)) under the constraint that 
4>a{{PaE)) — (f>a{(3oU). In other words, the present example is reduced to 
the previous one. 



2.7 Equivalent entropies 

Note that there exist other pairs (</>, lu) which arc linked to Tsallis entropy 
in a way similar to the previous example. This is the case whenever 

</)(w(x)) =ln„(x) (22) 

for some a 7^ 1 . One can see (^2|) as a definition of uj in case (f> is given, or 
as a constraint on if is given. 

Condition (|2^) implies that (p maps average information content (/) 
onto Tsallis' entropy. Hence all entropies in this class are equivalent in the 
sense that they are increasing functions of Tsallis' a-entropy or, if you wish. 
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of Renyi's a-entropy. Note that the result of the optimization problem does 
not change if the entropy is replaced by a monotonic function of the entropy. 
Hence all what happens is that for a given (nonlinear) constraint we adapt 
the a-entropy to an equivalent entropy which is technically better adapted 
for solving the optimization problem. 



3 Solutions 

In this section a is fixed and and oj are related by (|2^). 

The original optimization problem is reformulated as maximizing Tsal- 
lis' entropy S'„^^''"'(p) under the constraint 

Y,Pk^iPoEk)^'Pil3oU) (23) 
fc 



3.1 Optimization results 

The formulas that follow are based on results already found in literature 
in many places, e.g. in H^. 

The equilibrium average is the KN-mean (|l|) with pk given by 

Pk = ^[l + ail - a)ixk - u)]'/^"-''> = 1- e^vj~a{xk - u)) (24) 
with a > and u < Xf^ for all k, with normalization Zi given by 

Z,=Y}^ + a{l-a){xk-uf_l^''-^^ (25) 

k 

and with Xk = (f>{PoEk). The unknown parameters a and u have to be fixed 
in such a way that ( p3[ ) holds. This condition can be written as 

a[l — a) \Zi J 

with Zq given by 

Zo^J2^l + ail - a){xk - u)r/''~'\ (27) 

k 

There is only one condition to determine the two parameters a and u. A 
convenient way to fix u is to take it equal to the lower bound of the Xk (we 
assume that Xk is bounded from below). In that case one can guarantee 
[ p6| that the solution of the optimization problem, if it exists, occurs for 
a > 0. 



7 



The entropy S{U) — (/)max follows from ( p7| ) using (p4[). One obtains 

m = T^(^-i)- (28) 



1 - a V 

Expression ( p^ ) with < a < 1 is related to the kappa-distribution or 
generalized Lorentzian distribution |Q 

[1 + av'l 

It is frequently used, e.g. in plasma physics to describe an excess of highly 
energetic particles 



Typical for distribution (24) with a > 1 is that the probabilities pk 
are identically zero whenever a{a — l){ip{Ek) — u) > 1. This cut-off for 
high values of Ek is of interest in many areas of physics. In astrophysics 
it has been used to describe stellar systems with finite average mass. 
A statistical description of an electron captured in a Coulomb potential 
requires the cut-off to mask scattering states [|2[ ^ . In standard statistical 
mechanics the treatment of vanishing probabilities requires infinite energies 
which lead to ambiguities. These can be avoided if distributions of the type 
M), with a > 1, are used. 



3.2 A thermodynamic relation 

For further use we calculate now the derivative of S w.r.t U. Since the 
equilibrium state depends only on the fee parameter a (u may be kept 
constant) we start with calculating the dependence of S on a. 
Equations (^6|) and ( p8|) can be written as 

^ = l + {l-a)cbiS) 
^1 

^ = 1 + a{l - a){<f>{l3oU) - u) (30) 

Hence, ( p8| ) can be written as 

HS) = ([1 + a(l - amPoU) - uTZ^-^ - l) (31) 

L — a 

On the other hand, (^ ) implies 

±Zo = --^-{Zo-Z,) (32) 
da 1 — a a 

Combining these two expressions one obtains 

j.tQ\ 1 + (1 - a)(j){S) d 

—0 5' = aa— — z-t-(P(PqU) 33) 

da 1 + a(l ~ a)[(p[poU ) — u) da 
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The final result is then 

J_ AS _ / 
Po dU da I da 

c^'iPoU) / d0(5) / d(t>{f3oU) 
(f)'{S) \ da / da 
^'W) l + {\-amS) 

I + a{l - a){^{l3oU) - u) ^ ' 

Note that a in this expression is still a function of U. 

3.3 Thermodynamic temperature 

The standard thcrmodynamical relation for temperature T is 

]__dS_ 
T~dU' 

In generalized thcrmostatistics this definition of temperature is not nec- 
essarily correct |2g|. The problem is that the thermodynamic definition 
of temperature (35) is not invariant under substitution of entropy by an 
equivalent entropy. This means that one can generalize definition ( p5| ) to 

1 d/(5') fffo^^^ 

where f{x) is any strictly increasing function of x i.e. the derivative f'{x) 
is positive. However, not all choices of f{x) are physically acceptable. 
Minimal requirements are that T is an increasing function of U, and that 
T = corresponds with U = i?ni, where = mink E^. 



(35) 



The formalism of Tsallis' thermostatistics|12 has been modified a few 



times [|15L In particular, the correct definition of temperature has 



been a difficult point |29|, 31, One can easily verify that the proposed 
definitions of thermodynamic temperature are of the form ( p6| ) . As argued 
in ( ^5| ) is correct in case the entropy is extensive. Only Renyi's a- 
entropies are extensive - see the next section. Hence we propose that f{x) 
should be fixed in such a way that f{S) is Renyi's entropy. This is the case 
if 

f{x) = ln(exp„(0(a;))) - ln(l + (1 - a)0(x)) (37) 

1 — a 

Indeed, one has 

0(5) = ^Tsallis = lnQ(exp(S'Renyi))- (38) 

Note that, taking (j)(x) = x in (|37|), one obtains 

1 _ df{S) 1 dS 



T dU 1 + {\- a)S dU' 

This coincides with the formula proposed in p9| . 



(39) 
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3.4 Discussion 



Combine ( |34|j36| ) with the above choice of function /(a;), one obtains 

1 _ aa(l3'{(3oU) 

" l + a{l-a){(j,{l3aU)-uy 

Make now the following choice of the parameter u 



u = (l){PoEm) with £'„ 

Then ( pO[ ) can be written as 

1 _ aa0'(/3oC/) 



min Ek ■ 

k 



(40) 



(41) 



(42) 



This is our general result for the relation between the parameter a and 
thermodynamic temperature T. From this expression it is immediately 
clear that, in case < a < 1, temperature T is always positive. The 
analysis of what happens if a > 1 is more complex and falls out of the 
scope of the present paper. In addition, if a tends to infinity, then U will 
become equal to E^. In this limit ( |4^ ) implies that temperature T goes to 
zero, as is physically expected. 

An open questions is wether T is always an increasing function of U, as 
it should be. This analysis is not made in the present paper. 

Next, use to eliminate a from expression ( |2^ ) for the probabilities 
Pk- One obtains 



Pk 



1 



1 



(b'iPoU)PoT 



{(f>ipQEk) - <f>il3oU)) 



l/(a-l) 



(43) 



with Cq, the appropriate normalization constant. The limit a = 1 of this 
expression is 



Pk 



1 



■ exp 



1 - 



1 



0'(/?of/)/3oT 



(44) 



If 4'{x) — X then this is the Boltzmann-Gibbs distribution. 



3.5 A special temperature 

Consider the case that 

a = 



1 + (1 - o)u 



Then (gj) simplifies to 



Pk = -exp„(-Xfc) 



(45) 
(46) 
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with C the appropriate normalization factor. Formula ( |42| ) becomes 

1 ^ a0'(/3ot/) 
/JoT 1 + (1 - a)0(/3of/) 



(47) 



In case of Renyi thermostatistics is (f> = (pa as given by (pO|). Then the 
r.h.s. of the above expression is identically equal to a so that one obtains the 
condition afSoT — 1. This shows that in this case the special temperature 
for which ( ^ ) holds is l/a/?o- One concludes that, because of the nonlinear 
averages, the value of /3o cannot be chosen arbitrarily. 

4 Nonlinear averages 

In the present section we try to clarify the role nonlinear averages can play 
in statistical mehanics. 

4.1 Large deviations 

A fundament of equilibrium statistical mechanics is the equivalence of en- 
sembles, known as the equipartition theorem in information theory. A 
related ||l8|, |l9|, ^ property is the asymptotic expression 

PTohi(3a{E -U)/NKe)r^ exp(-iVr(e)) (48) 

where r(e) is an appropriate rate function. With '/jv ~ exp(— A'^x)' we 
mean here that 

lim — ln/Ar = a;, (49) 

with a; « e we mean that x should ly in a small open neighborhood of e, of 
size independent of A^. This result from the theory of large deviations 
states that the probability that energy E deviates from its average value 
U is exponentially small in the size of the system A^. Physically, minus 
the rate function r(e) has the meaning of a relative entropy density in the 
microcanonical ensemble with energy U + Ne. The rate function r(e) is 
expected to be minimal for e = because the equilibrium state maximizes 
entropy under the constraint that average energy equals U. 

In case of identically distributed independent stochastic variables xi, 
X2, ■ ■ ■ with average value (x), the central limit theorem implies that 

^-^ ~ (x)) « 6 1 ^ e-^^'/'^'' (50) 

where is the variance of the distribution. Comparison with (^8|) shows 
that in this case r(e) = e^/2a^. In general, (Esl) will hold with different rate 
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functions. E.g., in the high temperature phase of the Curie- Weiss model, 
isU = 0{N°) and 



FToh{PoE/N « e) 



exp -N 



2e 
1^ 



e 



(51) 



(—/So J/2 < e < 0), with T the temperature, with J > the interaction 
constant of the model, and with 



Soo(m) 



i(l + m)ln(l + to) + i(l - to) ln(l - m) (52) 



(— 1 < TO < 1, see section (4.2) below). 

Let us now make the link with nonlinear averages. Fix a strictly in- 
creasing function (j){x). Then (MSh is equivalent with 



(53) 



PToh{(t){Po{E - U)/N) we) - exp{-Nr o ^-\e)) 



For thermostatistics it is important that the rate function r{e) is convex, 
at least in a neighborhood of e = 0. But clearly, that r(e) is convex does 
not imply that r o (f>^^[e) is convex. Conversely, if r(e) is not convex, as is 
e.g. the case in the Curie- Weiss model at low temperatures, then one might 
search for a function (p{x) such that ro (/)~^(e) is convex. This is a possible 
motivation for introducing KN-averages. The strictly increasing function 
4>{x) changes the scale used to observe energy fluctuations. The choice of 
function should be such that the resulting rate function is convex. 

For example, if we take 0(x) = sign(x)-\/Txf and apply rescaling to ( ^l|) 
we obtain 



Prob (^^-l3oE/N we) - exp(-7V s^o [^\f^ 



(54) 



Note that in the Curie- Weiss model the square root of energy is propor- 
tional to total magnetization M. In fact, it is tradition to study large 
deviations of the Curie- Weiss model in terms of M instead of E, in the 
context of a grand canonical ensemble instead of the canonical ensemble 
studied here. 

The link with Kolmogorov-Nagumo (KN) averages (see definition (|l])) 
is immediate, be it on an intuitive basis. We introduce probabilities pk of 
a canonical ensemble based on (|5^), i.e. after rescaling energy fluctuations 
with the function 0, because then we can expect equivalence of ensembles 
to be valid. The actua l ne ed for nonlinear averages, as given by (^, will 
be discussed in section 5.4. 
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4.2 The Curie- Weis model 



For a mathematical treatment of the Curie- Weiss model in the context of 
the Boltzmann-Gibbs formalism, see |l^, The energy of the model 
is given by 



= -—Ml 
2N " 



N 



hM, 



with Ma- = (Tj ■ 



(55) 



The spin variables aj take on the values ±1. The constant J is strictly 
positive. For simplicity, assume h > 0. 

Fix 7 > 0. Introduce an increasing function (t>{x) by 



(f>{x) = sign(a;)|a;|'''. 



(56) 



Because <j>'{x) is singular at x = when < 7 < 1 we have to be carefuU 
with the denominator (l)'{U) appearing in (^4|). Let m ~ M/N and v ~ 
—U/N and assume U < 0. There follows 



Pa 



— exp 

1 

77 exp 



M 

7T 



sign(£;„) 



U 



N (j){Jmy2 + hm) 



(57) 



Hence the probability of a given magnetization M is 



p{M) 



N 



1 



N+M J Qi exp 

exp(— 7Vr(TO)) 



N (j}{Jm^/2 + hm) 

7T 



with 



r(m) 



,(m) 



,1-7 



7T 



■(/)(Jto^/2 + hm) 



(58) 



(59) 



(see (^) for the definition of Sqo)- To obtain this asymptotic expression, 
Stirling's formula has been used. Note that ( p9[ ) is not in agreement with 
( plp4[ )! The result obtained here is (7 = 1/2) 



Prob 



(^^/-PoE/N 



exp{-N 



2 



/2Jv 



(60) 



The temperature T in this expression is defined by the thermodynamic 
relation (|3^), while in ( |5l| , |5^ ) it is a Lagrange multiplier. Since v = 0{1/N) 
throughout the high-temperature phase the second contribution to the rate 
function in (KG) vanishes in the limit of large N. 
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Assume r(m) is minimal at to = m*. The condition r{m^) = reads 



1 / 



Because v — Jml/2 + hm^, + 0(1/A^) holds, this simplifies to 

TO, = tanh ^ — (Jto» + h)^ . (62) 

This is the standard mean-field equation. It does not depend on the choice 
of 7. 

Let TO — TO, + e. Expansion of the exponent in ( |5^ ) around to,, assum- 
ing TO, > 0, gives 

p{M) ~ exp[-(l/2)iV/"(TO,)e2] (63) 

with 

= T^-T + (^-^)r jTO;/2 + feTO, 

Hence fluctuations do depend on the choice of 7. 

Let us now consider the case 7 < 1 and ft, = in more detail. Then 
TO, = is a solution of the mean-field equation at all temperatures. The 
corresponding value of v is of the order 1/N and can be neglected. Hence 



expression (59) reduces to the result of the ideal paramagnet. One con- 
cludes that, contrary to what happens in the 7 = 1-case, this solution is 
(meta)stable at all temperatures (a solution of the mean-field equation is 
said to be stable if it optimizes entropy for the given average energy, it is 
metastable if it is not stable but the rate function is convex in a neighbor- 
hood of the origin). 

5 Renyi thermostatistics 

Renyi's entropy corresponds with the choice (f> — (f>a (given by (^)) and / 
given by (|l0|), i.e. <j)a{x) — lnQ(e'^) and uj{x) = \ii{x). In the present section 
we study an obvious generalization of this entropy. It contains both Renyi 
and Tsallis thermostatistics as subcases. 

5.1 Definition 

Introduce the function 4>ap{x) given by 
(l>ap{x) = lnQ(expp(x)) 
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sign(:r)^([l + (l-p)|x|]t^"^'')-l) 



(65) 



Here, and in what follows, we use 9{a, p) as an abbreviation for (1 — a)/(l — 
p). Equation (65) is the obvious generahzation of ( pO| ) obtained by de- 
forming the exponential occurring in it. We use extended definitions of 
hia{x) and expp(a:), introduced in appendix. The inverse function of 4>ap 
is (/fpa. The entropy function is chosen in such a way that (22) holds. 
Now, (j)ap(u!{x)) = lnQ,(a;) implies ll}{x) — Inp(x). The average information 
content then equals 



1 



P 



,Pfc(/>ap(lnp(l/pfc)) 



1 



Sapip) 



(66) 



The two interesting limiting cases are p = 1 and p — a. If p = 1 the 
expression reduces to that of Renyi's entropies (|2^) . If p = a it reduces to 
the Tsallis expression (p^). Note that all entropies with the same a- value 
belong to the same equivalence class. The constraint {(3qE) — U can be 
written as 

^,Pk(f>ap{PoEk) = (f)ap{U) (67) 

It depends of course on the choice of both a and p. 



k 



5.2 Subsystems 

Renyi proved ^ that the only additive measures of information content 
are Shannon's entropy (11) and the family of Renyi a-entropies (^l|). Ad- 
ditivity means here that when a system is composed of two independent 
subsystems A and B then the entropy of the total system is the sum of the 
entropies of the subsystems. 

Let us consider additivity in more detail. A system consisting of two 
independent subsystems is described by probabilities pjk of the product 
form 



(1) (2) 
Pjk = P) 'Pl 



(68) 



A short calculation gives 



{I) 



(1,2) _ 



E 

jk 



P,k 



- 1 
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(69) 



If p = 1 this implies additivity of Renyi's entropy functional, li p — a this 
result is well-known in the literature about Tsallis' thermostatistics. 

Next assume that the relation between the energy functional of the 
composed system and those of the subsystems is 



PoE^l'^^ = Inp (expp(/3oi?f ^) expp(/3o£;f ) 



(70) 



For p = 1 this simply means that E^]:^^ = Ej^^ +eI^\ For arbitrary p (| 



implies that 

</)op(/3oi?]fe''') = ln„ (expp(/3o£;f ^)expp(/3oi?f^) 



(71) 



so that 



1 


1 




a 




1 




1 




a 




1 




1 




a 



Hp? T.Pk^ ( [expp(/3oi?f ^)expp(/3oi?, 



(2). 



Epy expp(/3oi?f E^i'' expp(/?o4'V-" - 1 

j k 

^p«(l + (l-a)0„p(/5o£:f ) 



with 



X (1 + (1 - a)0ap(/3o4") - 1 

x(i)+x(2) + (l-a)X«x(2) 



(72) 
(73) 



On the other hand is 



{(3oE('Y) + (/3oi?(^V'' + (1 - pmE('YHPoE^'^)<-'^ 
= lnp(expjX(i))) + lnp(cxp 

+ (l-p)ln,(expjX(i)))lnp(expjX(2))) 

= lnp{cxp„(X(i)) exp^ ( A ^ ' ) > 
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- In, 




(74) 



Combination of the two expressions gives the result 



{P,E) = (/3„£;(i))(i) + (/3oi?('V'^ + (1 - p)(/3o£;(^¥'^(/3oi?('V'^ (75) 



For (0=1 this proves the additivity for energies as expected. When p ^ 1 
this is a nontrivial result, even if p = a, in which case the average is linear. 

In general, equilibrium probabilities of generalized thermostatistics are 
not of the product form. Therefore, thermodynamic entropy 5, which is 
the maximum value attained by (/) is not necessarily equal to the sum of 
thermodynamic entropies of the subsystems. Of course, the product form 
is also absent in the standard formalism of thermostatistics when there are 
correlations between subsystems. If correlations are not too strong then 
the system in equilibrium is still extensive, by which is meant that ther- 
modynamic entropy S and internal energy U grow linearly with the size 
of the system. This is usually expressed by stating that the so-called ther- 
modynamic limit exists. We expect that also in the present case (i.e. with 
(j)a given by ( |20| ) and / given by (Q) thermodynamic limit exists under 
suitable conditions. This point has still to be studied. 

5.3 Two- level system 

As an example, consider a system with two energy levels Eq and Ei. As- 
sume Eq — Q and i?i > 0. Let u = 0. Then one has 

xq = (f)ap{/3oEo) = 4>apiO) = and 




(76) 



From ( p^ follows pa = I /Zi and 



pi =po [1 + a(l - c()xi] 



-1/(1-") 



(77) 



+ 



Note for further use that 





(78) 



From 



0a,p(/3oC/) 



PO0a(O) +Pi(/>q(1) ^PlXi 



(79) 
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follows 



l~PO^Pl = — (j)a,p{f3QU) 
Xl 



L^([l + (l_p)/3oC/]^(-P)-l). (80) 



1 + (1 - a)x_ 

This is the solution of the optimization problem maximizing information 
content given the constraint that {(3qE) = PqU. 

Entropy can be calculated using (|7q). One obtains 

^ (K+P?-i) (81) 



l-a 



so that 

S = ^((p;^+p?)^(P^")-l). (82) 

The optimum as a function of temperature is calculated now. From 
definition ( ^) of thermodynamic temperature, with f{x) given by (|37|), 
there follows 

1 _ ^apiS) dS 



f3oT l + (l-a)0„^(5) d/3oC/ 

1 dS 

1 + (1 - p)S d/3ot/ 
a pr^-pg"^ dpi 



l-a pg + dpoU 



(83) 



with 



dpi 1 



dPoU Xl 
1 



-^(l + (l-a)pia;i)(''-")/(i-") 

(expjpixi))''-". (84) 



Xl 

Let us shortly discuss these results. 

The condition T > requires that pi < po holds. The latter is also 
expected on physical grounds. The equality pi ~ po is reached when tem- 
perature T goes to infinity. Expansion for small pi, assuming < a < 1, 
gives 

PoT ~ ^xipj-" (85) 
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0.4 



0.8 



1.2 



1.6 



Figure 1: Occupation of the excited level as a function of aT, for p = I, 
PoEi — 1, and a = 0.5 (solid line), resp. 1.0 (dotted), and 2.0 (dashed). 

This means that for small temperatures the population of the excited level 



Below Tg is pi =0. Due to the cutoff in the probability distribution the 
first excited level is not populated at all. See Figure 1. 

In the special case a = 1 one obtains directly from ( [76| , [77| ) that pi = 
Po expp(/3o-Ei). The corresponding temperature is calculated using (p^ ) and 
satisfies 



In particular, as noted earlier, if p = 1, then the special temperature satis- 
fies aPoT — 1. Hence, as a funtion of aT, curves plotting pi as a function 
of T for different a, but p = 1, all intersect in the same point — see Figure 
1. 

5.4 Why nonlinear averages? 

We now show how nonlinear averages arise in a natural way. 
Note that 




/3oT= -(l + (l-p)/?of/) 



(87) 



MME-U)/N) 



^(l~a)PoiE„~U)/N _ 



1 — a 
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g(l-a)/3oS/iV _ g(l-a)/3oC//7V 
(1 - Q,)e(l-")/3oC//JV 

with l — aN = {l — a)/N. In particular, the equation (j)a{Po{E — U) / N) « e 
appearing in the l.h.s. of (p3|), can be rewritten as 

0„„ (/3oi?) - (/3of/) « (/3ot/) e. (89) 

This imphes that ( p3[ ) can be written as 

Prob((0„„(/3oi;) - (bc.^{PoU))/N « e) ~ exp(-Arr,(e)) (90) 

with 

r4e)^ro^~^{e/cl)MU/N)) (91) 

The problem is now to find a probability distribution for which ( po|) 
is satisfied. A candidate solution to this problem is obtained by maximizing 
entropy with the constraint that 

5Zp^0a„(/3o£^^) = 0a„(/3oC/) (92) 
The latter condition can be written as 



iPoE) =(3oU (93) 

with average (•) calculated as a KN-mean w.r.t. the function This 
justifies the use of nonlinear KN-averages. 

Remains the question which entropy should be optimized. We believe 
that in many applications the right choice is either Shannon's entropy or 
an a-entropy of Renyi (or an equivalent entropy like that of Tsallis). The 
reason for this belief is that these are the only entropies with nice additivity 
properties. 



5.5 Renyi thermostatistics of the Curie- Weiss model 

We reconsider the Curie- Weiss model, now with the function ip equal to 
(j)a,i = 4>a- It is not the intention here to treat this model in detail, with 
arbitrary values of a and p. We rather want to to discuss the thermody- 
namic limit in the extensive p — 1-case. 

Introduce the notations m = M/N and u — U/N, and 

7(m) = exp((l - a)/3o(JmV2 + hm + u)). (94) 
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The equilibrium probabilities are, assuming < a < 1, (see (^) and the 
discussion of the previous subsection) 



piM) 



N \ 1 
N \ 1 



ajv + (1 - ctN) 



Von 



l/(ajv-l) 



-N/{l-a) 



with 



exp 



s(m) 



Ns{m) 



1 



I- a 



■In 



1 - 



1 



1 - 



7(m) 



(assuming T > J). The first derivative of s(m) is 

l3o{Jm + h) 



I\m) 



m 



2 1 - m 1 + (/3oT- l)7(m) 



(95) 



(96) 



(97) 



Let TO* be the value of m for which s(to) is maximal. It is a solution of 
I'{m) ~ 0. Hence it satisfies 



= tanh 



/3o(J"^* + h) 
l + (/3oT-l)7(mO 



(98) 



But 7(m,) = 1 + 0(1/A^) holds for large N, so that the previous equation 
simplifies to the standard mean-field equation (|62|). 

In case T > J the rate function s(to) does not differ significantly from 
the standard result Soo{m) — Jm? /2T. If T < J the difference consists in 
the first place of a constant term. All together, the present treatment of 
the Curie- Weiss model seems to yield results which are very similar to the 
standard one. 



6 Discussion 

We have presented a formalism of thermostatistics based on Kolmogorov- 
Nagumo averages and Hartley's information measure. It treats the problem 
of maximizing average information (/) under the constraint that average 
energy (PqE) has some given value PqU. The KN-average (•) is determined 
by a monotonic function (j) and by probabilities pk- The same KN-average 
is used to define average energy and to define entropy as an average infor- 
mation content. In the present paper the function is kept constant while 
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the probabilities pk are varied. It could be interesting to consider cases 
where (j) is varied as well. 

Of particular interest is the case when (f> is an exponential function 
because then the average information (/) coincides with Renyi's entropy. 
As proved by Renyi, his a-entropies, together with that of Shannon, are 
the only additive ones. Instead of exponential (j) we use a slightly modified 
function (pO|), for which (/) is still Renyi's entropy. We call the statistical 
formalism obtained in this way Renyi thermostatistics. It generalizes the 
standard formalism of Boltzmann-Gibbs. Note that Renyi thermostatistics 
is extensive in the sense that, when a system is composed of independent 
subsystems and the probabilities are of the product form, then both average 
information and average energy of the total system are the sum of the 
corresponding subsystem averages. From the examples of the Curie- Weis 
model (section 5^) and the two-level model (section 5^) it seems that 
Renyi thermostatistics behaves quite similar to the standard formalism. 
The most striking difference is the energy cut-off which can occur in case 
a > 1. When this happens then the probability of occupation of high- 
energy levels is exactly zero. In particular, this kind of equilibrium state is 
not a Gibbs measure. 

We have tried to justify the use of KN-averages in statistical mechanics. 
In essence, a KN-average consists of a change of scale by means of a func- 
tion (j), followed by a statistical average using probabilities pk, and a final 
back transformation using the inverse function A basic ingredient of 
statistical mechanics is the equivalence of ensembles. A possible reason for 
combining the canonical ensemble with a change of scale is that in this 
way the equivalence of ensembles might be restored. We did not consider 
equivalence of ensembles directly, but argued on the level of fiuctuations of 
energy. These should be exponentially small in order to apply the theory 
of large deviations. It is intuitively clear that a change of scale influences 
the scale of fluctuations. This is indeed confirmed by calculations on the 
Curie- Weiss model which we used as an example. Now, if the function (jj 
is the function involved in the definition of Renyi's entropy, then scaled 
energy fluctuations can be expressed as fluctuations of the scaled energy. 
In this way we can justify that scaled energy is the relevant quantity of the 
canonical ensemble. 

In a natural way Tsallis' entropy appears as a tool for calculating equi- 
librium averages in Renyi thermostatistics. This offers the opportunity to 
reuse knowledge from Tsallis-like thermostatistics. It shines also new light 
onto Tsallis' nonextensive thermostatistics because there is a one-to-one 
translation of problems from the one formalism to the other. There are 
many applications of the Tsallis formalism — see e.g. the review paper 
[ p5| . An open question is how many of these applications have a natural 
formulation in terms of Renyi thermostatistics. As a first guess this would 
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be the case whenever extensivity is important. 

We consider also a further generahzation of extensive Renyi thermo- 
statistics to a non-extensive formahsm. This extended formalism includes, 
besides the extensive case, also nonextensive Tsallis thermostatistics. In 
particular, an argument has been presented which settles the problem of 
defining thermodynamic temperature in a nonextensive context. 



A short version of the present paper has appeared in 
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Appendix 

Definition ( p^ ) of the a-deformed logarithm hia{x), without the condition 
a; > 1, is what is found in literature The inverse function exp^(a;) 

is not defined for all x, while we need it for all x in section |^. Therefore we 
propose the following extended definition 

if x > 1 

if < a; < 1 (99) 

(we assume a > 0, a ^ 1). It satisfies \na{l) — 0. The deformed logarithm 
lnQ,(a;) converges in the limit a = 1 to the natural logarithm ln(a;). 
The derivative of Iuq, (x) is a continuous positive function given by 

\na{x) = if a; > 1 

dx 

= ifO<2:<l (100) 

In particular, this implies that lnQ,(a;) is a strictly increasing function of x. 
Note that 

— < \na{x) < if a > 1 (101) 

a — 1 a — 1 

while for < a < 1 the deformed logarithm takes on any real value. Hence, 
the inverse function exp^, (x) has a limited domain of definition in the former 
case. One finds 

exp„(2;) = [1 + (1 - a)a;]^/^^-") if a; > 

= [1 + (a - l)a;]y^""^^ if a; < (102) 



x^^" — 1 

lnQ(a;) = — 

1 ~ a 

a:"~i - 1 
a — I 
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Clearly, exp^(O) = 1 and cxp^(x) cannot be negative. In the limit a = 1 the 
deformed exponential function exp^^ {x) converges to the usual exponential 
function exp(a;). 

The derivative of ex.p^{x) 

^exp„(x) = (exp„(x))" if x > 

= (exp„(x))^"" ifx<0 (103) 

is a positive continuous function. In particular, this implies that eyip^{x) 
is a strictly increasing function on its domain of definiton. 
Finally, note that 

exp^(— a;) = l/exp^(a;) and lna(l/a;) = — lnQ,(a;). (104) 
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